中国科学技术信息研究所--国家工程技术数字图书馆

1. Dual-View Semantic Inference Network for image-text matching

[期刊] Wu, Chunlei Wu, Jie Cao, Haiwen Wei, Yiwei Wang, Leiquan 《Neurocomputing》 2021年426卷Feb.22期共11页

摘要 : Recently, image-text matching based on local region-word semantic alignment has attracted considerable research attention. The fine-grained interplay can be achieved by aggregating the similarities of the region-word pairs. Howeve... 展开

关键词 : Image-text matching Global semantic matching Local semantic matching

2. Image captioning with semantic-enhanced features and extremely hard negative examples

[期刊] Cai, Wenjie Liu, Qiong 《Neurocomputing》 2020年413卷Nov.6期共10页

摘要 : Image captioning is a task to generate natural descriptions of images. In existing image captioning models, the generated captions usually lack semantic discriminability. Semantic discriminability is difficult as it requires the m... 展开

关键词 : Image captioning Image-text matching Negative examples

3. Multi-view inter-modality representation with progressive fusion for image-text matching

[期刊] Wu, Jie Wang, Leiquan Chen, Chenglizhao Lu, Jing Wu, Chunlei 《Neurocomputing》 2023年535卷May.28期共12页

摘要 : Recently, image-text matching has been intensively explored to bridge vision and language. Previous methods explore an inter-modality relationship between an image-text pair from the single-view feature. However, it is difficult t... 展开

关键词 : Image -text matching Cross modal matching Multi view

4. Locally controllable network based on visual-linguistic relation alignment for text-to-image generation

[期刊] Zaike Li Li Liu Huaxiang Zhang Dongmei Liu Yu Song Boqun Li 《Multimedia Systems》 2024年30卷1期共13页

摘要 : Since locally controllable text-to-image generation cannot achieve satisfactory results in detail, a novel locally controllable text-to-image generation network based on visual-linguistic relation alignment is proposed. The goal o... 展开

关键词 : Text-to-image generation Image-text matching Generative adversarial network Local control

5. Cross-modal Graph Matching Network for Image-text Retrieval

[期刊] Cheng, Yuhao Zhu, Xiaoguang Qian, Jiuchao Wen, Fei Liu, Peilin 《ACM transactions on multimedia computing communications and applications》 2022年18卷4期共23页

摘要 : Image-text retrieval is a fundamental cross-modal task whose main idea is to learn image-text matching. Generally, according to whether there exist interactions during the retrieval process, existing image-text retrieval methods c... 展开

关键词 : Image-text retrieval relation reasoning graph matching cross-modal matching

6. Globally Guided Confidence Enhancement Network for Image-Text Matching OA

[期刊] Xin Dai Gulanbaier Tuerhong Mairidan Wushouer 《Applied Sciences》 2023年13卷9期共16页

摘要 : Image-text matching is a crucial aspect of multi-modal intelligence. The main challenge in this area is accurately measuring the relevance between the image and text, using evidence obtained through matching. Previous studies eith... 展开

关键词 : image-text matching global guidance multimodal interaction

7. Fusion layer attention for image-text matching

[期刊] Wang, Depeng Wang, Liejun Song, Shiji Huang, Gao Guo, Yuchen Cheng, Shuli Ao, Naixiang Du, Anyu 《Neurocomputing》 2021年442卷Jun.28期共11页

摘要 : Image-text matching aims to find the relationship between image and text data and to establish a connection between them. The main challenge of image-text matching is the fact that images and texts have different data distribution... 展开

关键词 : Deep learning Image-text matching Multimodal Retrieval

8. 1-D chaincode pattern matching for compression of Bi-level printed farsi and arabic textual images

[机翻] 二维链码模式匹配在波斯语和阿拉伯语文本图像压缩中的应用

[期刊] Hadi Grailu Mojtaba Lotfizad Hadi Sadoghi-Yazdi 《Image and Vision Computing》 2009年27卷10期共11页

摘要 : In some scripts, especially the Farsi/Arabic script, letters normally attach together and produce many different patterns, some of which are fully or partially similar. Detecting such patterns and exploiting them to reduce the lib... 展开

关键词 : pattern matching Bi-level text image image compression chain code farsi and arabic text images

9. Bi-Attention enhanced representation learning for image-text matching

[期刊] Tian Y. Ding A. Wang D. Luo X. Wan B. Wang Y. 《Pattern Recognition: The Journal of the Pattern Recognition Society》 2023年140卷共5页

摘要 : ? 2023 Elsevier LtdImage-text matching has become a research hotspot in recent years. The key point of image-text matching is to accurately measure the similarity between an image and a sentence. However, most existing methods eit... 展开

关键词 : Bi-attention Image-text matching Polynomial loss

原文获取

10. Text-image matching for multi-model machine translation

[期刊] Xiayang Shi Zhenqiang Yu Xuhui Wang Yijun Li Yufeng Niu 《Journal of supercomputing》 2023年79卷16期共14页

摘要 : Multi-modal machine translation (MMT) aims to use other modal information to assist text machine translation and to obtain higher quality translation results. Many studies have proved that image information can improve the quality... 展开

关键词 : Multi-modal Text-Image Matching Similarity Machine translation